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We consider the large deviations of the smallest eigenvalue of the Wishart-Laguerre Ensemble. 
Using the Coulomb gas picture we obtain rate functions for the large fluctuations to the left and the 
right of the hard edge. Our findings are compared with known exact results for 13 = 1 finding good 
agreement. We also consider the case of almost square matrices finding new universal rate functions 
describing large fluctuations. 
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In the early 50s, when not much was known about the 
intricacies of complex atomic nuclei, Wigner suggested to 
replace the underlying physics of the problem by its ap- 
parent statistical features [l| . Surprisingly, it turned out 
that such statistical approach was more useful than any- 
one could have anticipated, enabling him to work out the 
nuclei level spacing distribution. Random Matrix Theory 
(RMT) has played a central role in various branches of 
science, since its first appearance in Statistics by Wishart 
in 1928 y, through QCD [3|, random graphs [J|, wire- 
less communications [5] and computational biology [6| - 
to mention a few. And although through more than half 
a century different, seemingly unrelated, problems have 
been linked via RMT, certain questions about eigenvalue 
distributions are still poorly explored. One such question 
is the distribution of the smallest eigenvalue. 
In physics, for instance, classical disordered systems offer 
the ideal context where RMT concepts and tools may be 
applied. Here, complicated systems are simplified by us- 
ing a random Hamiltonian, where the smallest eigenvalue 
is of special interest as it is associated with the ground 
state. In quantum entanglement, the smallest eigenvalue 
is a useful measure of entanglement [7'| . 
In mathematics, the minimal eigenvalue appears natu- 
rally in many contexts, such as the study of the geometry 
of random polytopes [8] . It is also extremely relevant for 
the question of invertibility of random matrices [9|] j a-nd 
recently played an important role in the exploding field of 
compressive sensing |10| , where fluctuations of the mini- 
mal eigenvalue set the bounds on the number of random 
measurements needed to fully recover a sparse signal. 
In statistics, a very important technique used to detect 
hidden patterns in complex, high-dimensional datasets 
is called "Principal Components Analysis" (PCA). The 
idea is to take a data matrix X, and to transform its 
covariance matrix W — X^ X into a new coordinate sys- 
tem such that the greatest variance by any projection of 
the data comes to lie on the first coordinate. Technically 
speaking, one identifies eigenvalues and eigenvectors of 
W , and ignores the components corresponding to the 
lowest eigenvalues, as these eigenmodes contain the least 
important information. The smallest eigenvalue of W de- 
termines the largest eigenvalue of W^^ , and is important 



in Hotelling's T-square distribution ll|, for example. 



The purpose of this work is to provide a simple physical 
method, based on the Coulomb gas method in statisti- 
cal physics 12|, that allows us to compute analytically 



the probability of the smallest eigenvalue in the Wishart- 
Laguerre ensemble. 

We consider an ensemble G{M, N) oi AI x N rectangular 
matrices X S G{M,N), which are drawn from a Gaus- 



sian distribution P{X) oc exp 



2 ^^ 



(x^x) 



is the Dyson index with classical values /3 



where /3 
1,2, and 

4, corresponding to the real, complex and quaternionic 
cases respectively. The Wishart ensemble W{M, N) is 
defined as the set oi N x N matrices W — X^X. If 
A denotes the N eigenvalues of a Wishart matrix, their 
joint PDF reads 



P{\) 
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Z(0) 



-/3F(A)/2 
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with Z{0) a normalisation constant and F{\) defined as: 



N 



N 



F{X) = 51 ^» - " E ln(AO - Y. 1'^ l^» - ^1 



(2) 



i=l 



iy^J 



where u^l + M-N-j. 

We will restrict ourselves to the case M > N. It is well 
known (l3| that for large A^ the density of eigenvalues 
Pat (A) = jffi^) is given by the Marcenko-Pastur law: 



fi^) = ^V(^-C-)(C+ 



^)'^xe[C-X+] (3) 



= i^ and c = ^. The 



with C± = (1 ± \/T+a)2 
dicator Ix^D takes the value 1 ii x £ D and otherwise. 
The points (- and (+ are usually called the hard edge 
and the soft edge of the distribution ([3]), respectively. 
The case a = (or M = N) corresponds to square ma- 
trices, and the case a = 0{1/N) (or M - TV = 0(1)) is 
referred to here as " almost square matrices" . 
We will focus on the statistical fluctuations around the 
hard edge, which are captured by the probability distri- 
bution of the smallest eigenvalue ppf^ W, from which 
any other statistical property of the hard edge may be in- 
ferred. A classical result (l4| shows that Amin converges 



almost surely to C_ as A^ — ?► cxd. However, the concen- 
tration of the minimum around this value is generally 
unknown, and can be a challenging task. 
The distribution of the smallest eigenvalue p)^'"'(A) (as 
well as for the maximal eigenvalue) has been formally 
expressed using Multivariate Hypergeometric Functions 
and Zonal Polynomials. This was first done long ago for 
real matrices {/3 = 1) in 11 511 . then generalised to com- 
plex matrices (/? = 2) in [l6| and only recently to any 



In this framework Z{t) is understood as the partition 
function of a 2D Coulomb gas of charged particles re- 
stricted on a ID line, in an external linear-log potential 
and a hard wall at t. The idea, as in 12|, is to evaluate 
Z(t) by using the saddle-point approximation. It is im- 
portant to notice that the expression ([5]) is exact and that 
while the saddle-point method provides the exact density 
of eigenvalues for large N, it is only able to capture the 
large deviations to the right of the smallest eigenvalue 
(3 in [17[. These expressions are rigorous but often not from p]^"'"'^(A). This is not entirely surprising as pnW 

is a collective quantity while p]^'" (A) is not. 
The calculation goes along similar lines as in [2J, |25j, 
so we shortly describe the needed steps. To apply the 
saddle-point method to (O , one first introduces the func- 
tion g{X) = -^ J2i=i (5 (A — Ai). This allows us to write 
the Z{t) as a path integral over g{X) and its Lagrange 
multiplier ^(A). Minimising the corresponding functional 
with respect to p(A), allows to eliminate this multiplier 
and to unveil the meaning of £i(A) as a constrained spec- 
tral density. After rescaling the eigenvalues (A — xN and 
C = t/N), Z{t) takes the form 



easy to evaluate and manipulate (although a real break- 
through has occurred recently with a new algorithm that 
can calculate such quantities with complexity that is only 
linear in the size of the matrix [18| , as well as available 
dedicated packages [19|). 

Another line of explicit expressions involve a determi- 
nantal representation of the distribution of the smallest 
eigenvalue (as well as other order statistics) for finite N 
and M (see [20| and references therein). These expres- 
sion are not always easy to implement, and expecially 
when M — A^ is large. 

More explicit expressions have been derived by Edelman 
in [m, [23 for /3 = 1 (for /3 = 2 as well, but only for 
square matrices). These expressions require the knowl- 
edge of some polynomials (different ones for any M and 
A^). While manageable and useful for small matrices, this 
turns out to be impractical for large matrices, since no ex- 
plicit formula exists for these polynomials. It is especially 
changeling to extract other relevant statistical properties 
around the hard edge like, for instance, the distribution 
of the typical (order 0{Np) with < p < 1) or large 
(order 0{N)) fluctuations around their mean values. 
The typical fluctuations of the smallest eigenvalue have 
been recently studied rigorously in [23| , where it is shown 
that the smallest eigenvalue Amin follows the Tracy- 
Widom (TW) distribution, that is, the typical fluctua- 
tions of Amin can be expressed as 



Zit) - / {Vf}e 



-§N^S[f{x)] 



(7) 



with the rescaled density g{X) — jj-fi^), and the action 



S[f{x)]^J dxf(x)x-(a + ^^ j dxf{x)\n: 



00 /'OO 



dxdx' f{x)f{x') In \x — x'\ 



(8) 



dxf{x) In f{x) + Ci / dxf{x) - 1 



X^,^^C-N-eJ'c^"N^"x 



(4) 



where x follows a TW distribution g^ix) [23]. This result 
was proven for the case c < 1 in the large A^-limit, and 
so strictly speaking, it does not apply to either square or 
almost square matrices. 

The knowledge about large deviations of the smallest 
eigenvalue is, as far as we are aware of, mainly unex- 
plored. We would like here to correct the situation. To 
do so we will use the Coulomb gas approach [12]. Starting 
with Eq. (HI the cumulative probability of the minimum 
Pn \^) = ^('^min > t) can be written as 

4"'"^w= rrfvi,"^'"^(A)=ii (5) 



2 
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As we will apply the saddle-point method we neglect the 
terms which are of order 0{N^^) in the preceding ex- 
pression for S[f{x)]. Note however that while the first 
term in the subleading correction can be easily kept, the 
entropic term makes the evaluation of the saddle-point 
equation a changeling task. The next step is to look for 
the saddle point of the action. The saddle point equation 
is basically SS[f{x)]/6f{x) = 0. It is useful to differenti- 
ate the saddle-point equation once with respect to x: 



a 
2x 



P 



M^) 



dx' , a; e [C, 00) 



(9) 



where /*(a;) is the value of f{x) at the saddle-point. This 
is a Tricomi integral equation, and is solved as in [2J] , so 
we report the final result: For ( e [0, C-], f*{x) is given 
by Eq. ©, while for C G [C-, 00), 



M^) 



VIT^ x~ay^C/U 



Z(0) 



27rV2r~C 



r^eicu] 



with 



(<)= / ••• / dAi---dAwe-^^(^)/2 (6) 
Jt Jt 



Z(t) = 



with U = U{c, C) = w^{c, C) and w{c, () given by 

2p f9 + 2n 

^(^'O^^^cos 



(10) 



(11) 



with p= -[C + 2{a + 2)], q = 2aVC, P = '\J-§j, = 
atan (^) and B = - ff^ + ^V Using these resuhs 
and after a long and tedious calculation we obtain the 
following results for the distribution: for t e [0, A^C-] we 



have Pj 



(min) 



j(min; 

N 



pr'"{t) 



^ (t) = 1, and 

e-^^'^'r"'C-'^),tem-,oo) (12) 
with the right rate function $^'" (a;) being 

<i>^r'\x) = \[S{x + C-)-S{C-)], xe[0,^) (13) 

and the action S{() = S[f^,{x)] given by 
C + U A2 



^(C) 



32 



In 



+ — In {(U) - a{a + 2) In 



(T)-f(^-v^ 

u + VC 



(14) 



where, A = U{c, () — (. As mentioned above, the saddle- 
point approximation is only able to capture the large 
fluctuations to the right of Amin as it has a hard wall 
on the left side ([T2|) . Fortunately, the authors of [26| 
came up with a beautiful physical argument to overcome 
this shortcoming and to estimate in their case the large 
deviations from the maximal eigenvalue. Applied to the 
fluctuation to the left of Amin this method yields 



-,(min) 

N 



Plr '{t) 



-^Af* 



„) f NC_-t^ 



with the left rate function cj)*^'" 

-(min 



$^""'^(a;) 
-2 In 



■In 






', te[0,NC-] (15) 

(a:) given by: 
(l - ^) - iV^v/^+AT (16) 

a In I 1 + 2, 



y/x + A^-^ 



A_ 



with X e [0, C-] and A_ = C+ - C- = Wa + 1. 
We now check that the smallest large fluctuations pre- 
dicted by our results match the largest typical fluctua- 
tions given by the TW distribution [23[. Expanding the 
rate functions $i^™ (x) around x = gives: 
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(min) 



$ 



(min) 



(x) 
(x) 



.0 3C-ci/4^ 



r3/2 



^0 24C^VS 



(17) 

(18) 



-,(min). 



which yields the following expression of P^ (i) for the 
smallest large fluctuations of Amin^ 



P, 



(min) 



N 



(t) 



{-fx'^'it)) 



exp(-^X"'""WJ , te[0,NC-] 
exp(-^|x(t)p), ie[iVC-,oo) 



with x{t) = -{NC- ~ t)l{N^I'^CJ.^c^l'^). This result ob- 
viously agrees with TW distribution for large |x| [231.127 [ 



For larger atypical fluctuations, we have the following 
asymptotic behaviours of the rate functions: 
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K- 
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-fln(C- 



x) + o(C 



ln(c) 
2c 



a In a 



$1 \X) 



f Inx- 



C- 



x) 

3 _ 
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S(C- 



(19) 



(20) 



Interestingly, these results can be compared with the ex- 
act asymptotic behaviours predicted in 22| for the case 
/3 = 1 and large A''. It turns out that for $_ "'(x) the 
leading logarithmic behaviour agrees perfectly with [22], 
while the constant term cannot be rigorously compared, 
since it is not available for any N in the exact treatment 
|22| (although we know that a constant term exists). For 
(Ijiminj (x) we again see perfect agreement with the linear 
and logarithmic terms, but the constant deviates from 
the exact one given in 122]. 

A comparison of these results, as well as with results of 
simulations and the TW distribution (28| are summarized 
in Fig.^ Note that exact results are only available for 
/3 = 1 [22|, and so a comparison to exact results for en- 
sembles other than the real Wishart case is not possible. 



800 




FIG. 1. (color online). Results for the smallest eigenvalue 



distribution —hiP^ if) vs. the scaled variable tjN. Here, 
A^ = 11 and M — 110 (c = 1/10), Wishart matrices are real 
(/? = 1). The large deviation functions (dashed lines) compare 
very well with the exact result of 22] (solid line), while TW 
(dotted line) describes only small fluctuations. 



We now pay special attention to the case of almost square 
matrices, namely when M = N + a with a an integer of 
order unity (i.e. a — a/N, so a is of order 1/N). Here 
it is useful to use the scaling z — Nt — N'^( to unravel 
non-trivial results. Using the Coulomb gas approach we 
flnd that in this special case, the PDF of Amin does not 
simpl y ap proach a delta function, in the large N limit, 
as in [ij]. Instead, the cumulative distribution of z has 
a TV-independent limiting shape as shown in Fig. [5] for 
/3 = 1, and whose large fluctuations are described by the 



Coulomb gas prediction: 



P; 



(niin) 



A^ 



(^) 



exp ( —/3a^ 



(min) 



(!t) 



exp -/3a2vl/f'"^(f|) 
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4V^+3 






'^ V^; are universal 
TV, /3 and a-independent 
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z e [0, aV4] 

z e [0^/4,00) 

(21) 

-|-lna;)/8 and 

The functions 



in the sense that they are 
Note that in this particular 
regime one needs to keep the first 1/iV term appearing 
([8]) as it is of the same order as a = ajN . This can 
be accounted for easily by replacing in the preceding 
expressions a — > a(/3) = a + (/3 — 2)//3, as long as a(/3) 
is non-negative. To our knowledge, this is the first time 
that this case is discussed and shown to be universal 
(even numerically), and obviously no general explicit 
predictions for its shape have been proposed, apart of 
the particular case of aifi) — for which Eq. (|2ip yields 
P^™\z) — e"''^/^ in agreement with the exact result 
of [H (for a = /3 = 1) and [IH (for a = 0, /3 = 2). Note 
also that the results for TW distribution reported in [23[ 
do not formally apply to almost square matrices. 
In Fig. [2] we have compared our finding with Edelman's 
exact result for fl = 1, TV = 200 and M = 205. 




FIG. 2. (color online). Results for the smallest eigenvalue 
distribution — InPjS^'" (t) vs. the scaled variable tN . Here, 
A^ = 200 and a = 5, Wishart matrices are real (/3 = 1). The 
large deviation functions derived here (dashed lines) compare 
well with the exact result of [22| (solid line). 



It is important to point out that while the results in 
[231 are exact, they become difficult to evaluate for large 
values of N and a daunting task to extract exact results 
about large fluctuations for large matrices. In contrast, 
the result (PTI) provide information of large fluctuations 
for any value of /3 and a, when a(/3) > 0. 

To summarise, in this work we study the large devia- 
tion functions of the smallest eigenvalue of the Wishart- 
Laguerre Ensemble using the Coulomb gas approach. We 



obtain explicit expressions for both the right and left rate 
functions for general a. We also highlight the existence of 
a special regime for almost square matrices, where an in- 
teresting limit distribution exists, described by universal 
rate functions "Jfi^™' [x). We were able to provide predic- 
tions for a(/3) > 0, which leaves the question of a(/3) < 
open for further research. Another interesting open ques- 
tion is regarding the typical fluctuations for the smallest 
eigenvalue for square and almost square matrices, which 
are not captured by the TW distribution. 
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